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An extension of quantum mechanics to a generalized Nambu dynamics leads to a new version 
of nonlinear quantum mechanics. The time evolution of states is given in here by a triple bracket 
generalization of the Liouville-von Neumann equation, where one of the generators is an average 
energy, and the other is a measure of entropy. A nonlinear evolution can occur only for mixed states, 
and for systems that are described by Renyi a-entropies with a 7^ 2. The case a = 2 corresponds 
I to ordinary, linear quantum mechanics. Since a = 2 entropy is the only entropy characterizing 

systems which cannot gain information, the nonlinear dynamics corresponds to "observers", that is, 
systems that can gain information. The new formulation of nonlinear quantum mechanics is free 
from difficulties found in earlier attempts. The connection of linearity with possibilities of gaining 
C ■ information is in a striking agreement with the ideas of Wigner formulated in his paradox of a friend. 

> ! I. INTRODUCTION 

o 

■ In the first part of this article I have described the fundamental theoretical difficulties of nonlinear quantum 
mechanics (NLQM) based on a nonlinear Schrodinger equation. In the present paper I will present a different 

■ generalization of quantum mechanics (QM) where the nonlinear evolution never occurs for pure states (the Schrodinger 
' equation is hence always linear in this framework) but, instead, may appear, under some circumstances, for mixed 

states. We will see that the new approach will be free from the difficulties discussed in Ref. 
f~i The proposed generalization is based on the idea of rewriting the Liouville-von Neumann equation in a triple bracket 

C^, form, introduced by Bialynicki-Birula and Morrison The triple bracket is an infinite dimensional analog of the 
Nambu bracket 1^ where, as opposed to the structure constants tum of the rotation algebra appearing in the original 
Ch ' Nambu bracket, the structure constants correspond to some infinite-dimensional Lie algebra. In the original Nambu 
paper an evolution of a physical system (a rigid rotator) is generated by two "Hamiltonian functions" , the energy H 
and J, where the latter is the Casimir of so(3) (squared angular momentum). The metric tensor used for constructing 
the Casimir is, as usual, the one related to the Killing form In the triple bracket formulation of QM the analog of 
J is the Casimir S = l/2Tr(p^) which also can be written as g°'^PaPb although, as we shall see later, the metric g""^ is 
no longer given by the Killing-Cartan tensor (which does not exist in this case). The Casimir S was termed in Ref. 
^ the entropy. It will be argued below that the assignment of the name "entropy" to S should not be regarded as 

, accidental, but as a reflection of a deeper principle relating dynamics with information. 

The fact that some sort of such a relationship should be present in QM follows already from the Copenhagen 
interpretation of a measurement (reduction of a state vector), but my approach will be essentially different and closer 
in spirit to Wigner's paradox of a friend Let me recall that Wigner, in order to solve the paradox, concluded that 
a conscious observation must be accompanied by a nonlinear evolution in the space of the observer's states. Even 
though the argumentation of Wigner looks convincing, it seems that standard quantum theories do not leave room 
for a physical principle of that kind. It is surprising that the triple bracket formalism does lead quite naturally to this 
phenomenon, if wc seriously treat the intuitions of Bialynicki-Birula and Morrison that S is & measure of quantum 
entropy. 

Putting things more modestly, one can say that the results of this paper, even if their interpretation will turn out 
inadequate, show that the structure of quantum dynamics may be a part of a more general, nonlinear framework. 

The structure of the paper is the following. In Sec. II I discuss various measures of information (entropies) and their 
possible relationship with quantum mechanics. With this background we will be able to understand why Tr(p^) is a 
natural measure of information characterizing systems that cannot gain information (and, accordingly, for which the 
paradox of a friend cannot be formulated). In Sec. Ill, I present a triple bracket formulation of QM and introduce the 
composite index form of the equation of motion. In Sec. IV I will discuss an extension of the formalism to nonlinear 
theories in which the second "Hamiltonian function" , here interpreted as a measure of entropy, is represented by 
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Casimirs of order higher than 2. Such Casimirs correspond quite naturally to higher order a-entropies discussed in 
Sec. II. The main technical results of this section are theorems on non-existence of "faster-than-light telegraphs" and 
on conservation of positivity of pt by the nonlinear evolution equation resulting from the triple bracket formalism. It 
is also shown that homogeneity preserving generalizations of a-entropies lead to linear evolution of pure states, and 
that the a ^ 1 limit of a-cntropics (the Shannon limit) can be regarded as a kind of classical limit for the generalized 
QM. Finally, I discuss various possibilities of describing composite systems that consist of subsystems described by 
different entropies. The paper is concluded with a remark on the complementarity principle in nonlinear QM and an 
explicitly relativistic formulation is given in the Appendix. 



II. MEASURES OF INFORMATION AND QUANTUM MECHANICS 

A logarithmic measure of information was introduced by R. V. Hartley in 1928 According to him, to characterize 
an element of a set of size N we need log2 N units of information. It follows that a unit of information (1 bit) is the 
amount of information necessary for a characterization of a pair. Of course, one can choose also other units such that 
the unit is the amount of information necessary for a characterization of a set with < fc g N elements, or even with 
< r e R elements in average. The respective measures of information in arbitrary units a are \og^ N . The most 
important feature of the logarithmic information measure is its additivity: If a set E is & disjoint union of M A^-tuples 
El, ... , Em, then we can specify an element of this MA^-element set E in two steps: First we need log^j M units of 
information to describe which Ek of the sets £'i, . . . , Em contains the element, then we need log^ N further units to tell 
which element of this E^. is the considered one. The information necessary for a characterization of an element of E is 
the sum of the partial informations: log^ MN — log^ M + log^j N . Next step in the developement of the measures of 
information was done independently by C. E. Shannon and N. Wiener [|| in 1948 who derived a formula analogous 
to Boltzman's entropy. Their formula has the following heuristic motivation. Let E be the disjoint union of the sets 
El, ... , En having Ni, . . . , Nn elements respectively = A^)- Let us suppose that we are interested only 

in knowing the subset Ek- (This is typical for classical statistical problems in physics: Statistical quantities depend 
on classes of microscopic conditions and not on single microscopic properties.) The information characterizing an 
element of E consists of two parts: The first specifies the subset Ek containing this particular element and the second 
locates it within Ek. The amount of the second piece of information is, by Hartley formula, log^j A'^^ thus depends 
on the index k. On the other hand, to specify an element of E we need log^ A'^ units of information. The amount 
necessary for the specification of the set Ek is therefore 

A^ 1 

Ik - log, A^ - log, Nk = log, — = log, — . (1) 

A'fe Pk 

It follows that the amount of information received by learning that a single event of probability p took place equals 

Hp) ^ log,-. (2) 
p 

In statistical situations measured quantities correspond to averages of random variables. Therefore the average 
information is 



/ = ^Pfclog,— . (3) 



Pk 

This is the Shannon's formula and / is called the entropy of the probability distribution {pi,...,p„}. If all the 
probabilities are equal 1/A then the Shannon's formula is equal to the Hartley's one. The mean we have applied 
is the so-called linear mean. Renyi observed that there exist information theoretic problems where the measures of 
information are those obtained by more general ways of averaging — the Kolmogorov-Nagumo function approach . 
Let (/? be a monotonic function on real numbers. The Kolmogorov-Nagumo average information can be defined by 
means of ip as 



\ k 



P.^flog.^) . (4) 



If the generalized information measure is to satisfy the postulate of additivity, Lp must be a linear or exponential 
function. The linear function corresponds to Shannon's information. The exponential functions provide a large class 
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of new measures of information. Consider a function ip{x) = a^^ . We can always choose the units of information 
in such a way that 

i = v-'\y nvllot.^] I =T^logJ> pSI =logJ (> P?)"""°'V (5) 




For pfc — 1/N we obtain again the Hartley formula. Formula ^ describes Renyi's a-entropy which, from now on, 
will be denoted laCP), where V denotes the probability distribution. We see that the essential part of the definition 
is played by 

which is independent of the choice of the unit a. To distinguish between a-entropy and lai'P) we shall call the latter 
a*-entropy (* will remind us that this quantity is multiplicative in opposition to the additivity of laiV))- (The 
observation that what is in fact informationally fundamental in lai'P) -^a(^) strenghtened by Daroczy's definition 
of entropy of order a pc| ] defined as 

(2i-"_l)-i(^p^_l). (7) 

k 

This expression possesses many ordinary properties of the entropy and in the limit a — > 1 becomes, the so-called 
Shannon's information function.) 

The limit a ^ 1 is interesting also for a-entropies. It can be shown that Ii — limc^i equals Shannon's entropy. 

lai'P) is a monotonic, decreasing function of a. For negative a laiP) tends to infinity if one of pk tends to zero. 
This property excludes a < because adding a new event of probability to a probability distribution, what does 
not change the probability distribution, turns laiP) into infinity. 

A fundamental notion in information theory is the gain of information. Consider an experiment whose results 
are Ai, . . . , A„ having probabilities pk = PiA = Ak). We observe an event B related to the experiment and obtain 
a result B — Bi. Now the conditional probabilities are pki = P(A = Ak\B = Bi). Consider now a system (an 
"observer" ) whose information is measured by some a-entropy. How much information about the random variable A 
has he received by observation oi B = Bil The amount of information he would have obtained by observing A = Ak 
would be equal to 

loga - (8) 
Pk 

if he had not measured B. After having observed B ^ Bi the amount of information he would have obtained by 
observing A = Ak would be 

loga— ■ (9) 

Pkl 

It follows that the measurement oi B = Bi has given him already 

loga — - loga — = loga — (10) 

Pk Pkl Pkl 

units of information about A. The expression (|l^) is called the decrease of uncertainty about ^ = by observing 
B = Bi. We define the gain of information about A, obtained when the probability distribution {pk} is replaced by 
{Pki}, by 

,-.(^^„„,(,og„^)).^,„,.(^|;)^ ai) 

If we define the increase of the uncertainty by minus decrease of uncertainty we can calculate the average "loss of 
information" defined by 
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For Shannon's entropy the gain is minus the loss. For a-entropies the two concepts are inequivalent. 

The gain of information defined by (|Tf|) for a > 2 has the same pathofogical properties as la for a < so, it 
seems, cannot be consistently applied unless we restrict < a < 2. This is the reason why Renyi defined the gain 
of information as minus the loss, although such a definition is less netural. From the viewpoint of our quantum 
mechanical applications the situation is not so clear, however, and the following argument shows that a = 2 is a 
natural value limiting a-s from above. 

When we speak about information, what we have in mind is not the subjective "information" possessed by a 
particular, animate observer. In reality the information contained in an observation is a quantity independent of 
the fact whether it does or does not reach the perception of the observer (be it a man, some registering device, a 
computer, or some other physical system). On the other hand, different kinds of entropies introduced above may be 
characteristic for different systems. The entropy (information) is objective in the same sense as probability, and in 
the same sense it is reasonable to expect that there are classical and quantum informations, as there are classical and 
quantum probabilities. 

The procedure leading to the notion of the decrease of uncertainty assumes implicitly that after each measurement 
of a random variable, here B, one can always proceed further in getting information about A, and that the procedure 
terminates when we know everything about the state of the system. In classical world this final state of knowledge 
means no uncertainties. Therefore, classically, if there is some lack of knowledge about a system, then there exists, in 
principle, a possibility of gaining information. Putting it more formally, we can say that an information characterizing 
a classical system should allow for different gains of information in different situations. The quantum mechanical 
no-hidden- variables postulate means that the probabilistic description of a quantum system does not follow from our 
lack of knowledge about the system. This suggests that a quantum information, characterizing a quantum system, 
might be of such a kind that its corresponding gain of information is zero under all circumstances. It is tempting to 
develop this hypothesis a little and find whether a measure of information possessing this property exists. 

The Shannon's information gain is given by 

-^p.,log.^ (13) 

and vanishes only if A and B are independent. So this case can be excluded because we want the gain of information 
to be for all probability distributions (this excludes also the von Neumann entropy). For a-entropies we find that 



the vanishing of (11) implies 

E4S = Ep^(^H"pr) = i (14) 

k Pk k 

which can hold for all pk and pki if and only if a = 2. It follows that the only candidate for the quantum entropy is 
the Renyi's 2-entropy which reads 



Pi)- (15) 

k 

Expressing the probabilities by means of a density matrix and choosing the unit of information with a = e we obtain 

/2[p] = -lnTr(p2). (16) 

This kind of entropy is sometimes considered as an alternative to von Neumann's entropy pT| | . Our reasoning, based 
on the assumption that an ordinary quantum system should not have a possibility of gaining information, selects this 
entropy in a unique way. It is clear, from the perspective of the Wigner's paradox of a friend, that observers, who 
can gain information, should be described by a 7^ 2-entropies. 



III. POISSONIAN FORMULATION OF QUANTUM MECHANICS 

A departure point for the discussed generalization of linear QM is the observation that quantum theory can be 
regarded as a particular classical infinite dimensional Hamiltonian, Poissonian or Nambu-likc theory. 
Let 7i be a Hilbert space. Consider the Hamilton equations 

u> (a, a I ; — c ,^ , — 77 (1') 
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and c.c, where the bars denote complex conjugations and the conventions concerning primed and unprimed indices are 
assumed hke in the spinor abstract index calculus fl^ . The summation convention is as follows: We sum over repeated 
Roman indices and integrate over repeated Greek ones. The integration is with respect to some invariant, or quasi- 
invariant measure on a finite dimensional manifold (mass hyperboloid, spacelike hypcrplane in the Minkowski space, 
etc.). The symbol of the "proper time" derivative describes a differentiation with respect to a suitable foliation of 
space-time (Minkowskian spacelike, or Galilean t =const hyperplanes, etc., see Appendix). In Hilbertian formulation 
of QM the "symplectic form" is given by the delta distribution 



UJ 



(a, a') := iS^"^' S{a, a') =: lu^^' Sia, a') (18) 



where 5^^' = 6ab' = 1 if A = B' and for A ^ B' in the nonrclativistic QM. For the Dirac equation S^^ and Sab' 
can be represented by the Dirac matrix 70, and the Dirac delta function must correspond to the choice of the spacelike 
hyperplane. In the projective space formulation the symplectic form corresponds to the Fubini-Study metric. The 
inverse of ui^^ {a, a') is 

lAA'{a,a') := -i6AA'S{a, a') =: lAA'S^a^a') (19) 
where by the inverse we understand that 

uj^^' («, a')lBA' (P, «') = S^S{a, 13) (20) 
LJ^^\a,a')lAB'{a,(3') = 5i5{a\(3'). (21) 

Accordingly, the form of the Hamilton equations wc shall use is 

d^Aja) , SH 
(XT o^)\,\a) 

and c.c. ( ^ describes a quantum evolution of pure states. All observables of the linar theory depend on ji/') and (^/jj 
via the density matrix p = |^/;) (?/;|. Let F and G be two such observables, that is F\i^, V*] = P\p\ ^^nd G\^, V*] = G\p\. 
The Poisson bracket resulting from the Hamilton equations is 



{F,G} ,5VA(a)5V'A'(a))' ^^^^ 



Applying the chain rule to the components of the pure state density matrix 

PAA-{a,cJ) = VA(a)V'A'(a') (24) 

we find that 

{F,G} = Iaa\ ^ '^f r,,. PcB'{l.0) , , - (i^ - G)). (25) 

\bpAB'\ot,^') 5pcA'{l,a) ' 



So long as the density matrix in (^5|) is given by ( |24[ ) the bracket is equivalent to the Poisson bracket (^3|). Jordan, 
in a context of the Weinberg's theory ||l^] and for a finite dimensional Hilbert space, investigated properties of the 
bracket (25) with p being an arbitrary density matrix. For reasons that will be explained below I will term such a 
general bracket the Bialynicki-Birula-Morrison-Jordan (BBMJ) bracket. 

We will now show that (p5|), for a general p, can be written in a form of a generalized Nambu bracket. Let p be 
arbitrary. The BBMJ bracket can be rewritten as 

{i^,G} =MA'(a,«')r!-^^Bi3'cc'(«,«',A/?',7,7')T ^1 c (26) 

0PBB'[P,P') OpcC'(l,r) 

with 

n^^BB'CC'ic^, a', /9, 7, 7') = S^S^Jsc'Sia, -f)5{a' , f3')S{f3, 7') 

-SiS^:icB'Sia,m^',l')Sil,P') 
= (27) 
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where, in analogy to the spinor calculus, we have clumped together the respective quadruples of indices into composite 
ones (a = {A, A' , a, a'), etc.). 

The "structure kernels" fi"^^ satisfy conditions characteristic for Lie-algebraic structure constants: 

= -^\c (28) 

and 

+ + = o (29) 

These two conditions imply the Jacobi identity. The composite index form of the BBMJ bracket 

{F,G} ^ Pan\J^f- (30) 

shows that it takes the same form as the generalized BBM-Nambu bracket written in terms of the Wigner function for 
a scalar field As a matter of fact, the BBMJ bracket is simply a different representation of the BBM bracket. The 
formula (pO)_looks much the same as the Poisson bracket related to the Kiryllow form on coadjoint representations of 
Lie groups HT^ (such brackets for general structure constants are called the Lie-Poisson brackets (cf. [p^)). 
It remains to find out how to formulate the explicit triple bracket equivalent to (|30|). 

In order to do this we first have to define a "metric tensor" to lower the upper index in the structure kernels 
(Bialynicki-Birula and Morrison avoided this difficulty because the field they considered had no spinor components). 
The apparently natural guess (the Killing-Cartan metric) 

gab = (31) 

is incorrect as (^ij) involves expressions like (5(0) which are not distributions in the Schwartz sense. 
The correct definitions are 

9ab = -Iab' (a, P')Iba' (/3, a) (32) 
g-^ = -u;^B'{a,(3')co^^'{(3,a'). (33) 

The metric tensor is symmetric 

gab = gba (34) 

and satisfies the invertibility conditions 

5"V = dcbg'" = S^S^:Sia, -f)d{a', 7') =: 5^ (35) 

The metric tensor is a useful tool. Consider for example a p-independent Fi, ~ Fbb'{Pi P')- Then 

F[p] = g-'paF, = p^'^,i(3',a')F^s,ia',(3') - TrpF (36) 

and we see that linear observables can be naturally expressed with the help of (|3^). This example is important also 
as an illustration of the convention concerning lowering and raising of indices. For notice that 

= F^'s,{a',l3') (37) 



Sp%{f3',a') 

although the staggering of indices like F^A (/?', a') might seem more natural. 
The fully covariant form of the structure kernels is 

^abc = -lAB'{(y,P')IcA'{l.a')lBC'{P.i)+lAc-{a.i)lBA'{(i.a')IcB'{l:P'). (38) 

One easily verifies that fJafcc is totally antisymmetric. 

Following Bialynicki-Birula and Morrison let us introduce the functional 

S2 = \g^'paPb=\TT{p\ (39) 
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which is one half of the inverse of Renyi's 2*-entropy. 

The BBMJ bracket is now equal to the following triple bracket 

{F, G} = [F, G, S2] = ^abc^^^-^. (40) 

OPa Opb dpc 

The antisymmetry of the triple bracket means that 6*2 is the Casimir for the BBMJ bracket Lie algebra of observables. 
Another Casimir is Tr p because {Trp, _F} — for any differentiable F (hence not only linear). The wave functions 
have been eliminated from the dynamical equations, but the Hilbert space background is implicitly present in the 
structure kernels and the metric tensor which are defined in terms of lu and /, and in the very notion of the density 
matrix which acts in the Hilbert space. 

Components of the pure state density matrix satisfy 

^p,^{p,,H}. (41) 

which holds also for general density matrices as can be seen from the familiar, operator version of the Liouville-von 
Neumann equation. It follows that the density matrices form a Poisson manifold, as opposed to state vectors that 
form a phase space. 



IV. NONLINEAR QUANTUM MECHANICS AS A GENERALIZED NAMBU MECHANICS 

The generalizations of quantum mechanics considered by Kibble and Weinberg are based on the Hamilto- 
nian framework. The nonlinear evolution is introduced through an extension of the class of admissible Hamiltonian 
functions. More generally, all canonical transformations are generated by a larger class of functionals on Hilbert 
or projective spaces. The functionals are a generalization of averages of observable quantities. This fact leads to 
the fundamental difhculty in constructing a probability interpretation of such theories: The generalized observables 
do not form an associative algebra which makes impossible a unique definition of powers of observables, the formal 
counterpart of higher moments of random variables measured in experiments. 

The triple bracket form of the Liouville-von Neumann equation shows that the time evolution in linear QM has, in 
fact, two generators: the average energy (Hamiltonian function) and the Casimir S, which measures Renyi's a = 2 
entropy (or, even more directly, Daroczy entropy of order 2). It is natural to ask what will be changed in the theory if, 
instead of generalizing the class of admissible Hamiltonian functions, we shall extend the class of entropies. A physical 
meaning of such an extension would be the one required by Wigner in his paradox of a friend: We extend quantum 
mechanics to systems that can gain information. The extended theory has a well defined probability interpretation, 
because the observables are represented by linear operators, provided the scaling by a constant, p — > Xp, is a symmetry 
of the dynamics. This imposes on the generalized entropies the 2-homogeneity condition: S{Xp) = X^S{p). 

Only for S[p] = l/2Tr(p^) the linear observables are closed under the action of the bracket {•, - js := [•, •, S']. If we 
extend the class of acceptable 5, we have to accept also a somewhatOB stronger form of the complementarity principle 
than in linear QM: Observables are always complementary to their (nonvanishing) time derivatives (see Sec. ^). We 
shall begin the discussion of the generalization with the question whether, for general S, the manifold of states is the 
Poisson manifold. 



A. The Jacobi Identity 

Let F, G, H and S be arbitrary twice functionally differentiable functionals. We consider the expression 
J = {{F, G}s, H]^ + {{H, F}s, G]g + {{G, F] ^ 
SF SG S 3H SS 

= T T T J -J -J i^def^abc + ^bdf^aec + ^ebf^adc) (42) 

Spd Spe SpaSpf Spb Spc^ 

which holds good for any S. — g°'^ for 8 = 82 and (^) vanishes in virtue of (|2^). For more general S — S{f2[p\) 

we find 

T- = 2— p= 43 

OPc 0/2 

= ^^P P^ + 2^3°^. (44) 



SpaSpf 9/1 df: 
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Inserting these expressions into ( |4^ ) we obtain 

SF SG 0^ S SU QS 

Spd Spe dfi 5pbdj2 II 
since VlabcP°'p'^ = 0. With this choice of S we obtain the dynamics given by 

^Pa = {Pa,H}sMp\ (46) 

where C[p\ — = C'(/2[p]) is an integral of motion, as we shall see later. The only difference with respect to 

ordinary QM would be in a p-dependent rescaling of time, a phenomenon that, in principle, might influence lifetime 
characteristics of physical processes. 

For more general S the question of the Jacobi identity is open, hence we have to accept the possibility that mixed 
states in the generalized QM do not form a Poisson manifold. This would not be surprising, since in various versions 
of generalizations of the Nambu mechanics, the Jacobi identity does not hold. 

B. Composite Systems in the New Framework 

Let the Hilbert space in question and the density matrix of some composite system heTL — TLi ® Ti.2 and 

Pa = pAA'{a,a) = PAxA^A'^A'^{ai,a2.,a-^,a2)- (47) 
The same doubling of indices concerns 

lAA'{a,a') = -i5A^A'^5A2A'^5{ai,a[)5{a2,a'2). (48) 
Reduced density matrices of the two subsystems are 

= ^'^"^"^(a2,a2)PAiA2A;A;,(Q:i,a2,a'i,a2) (49) 
("2,^2) = ^^"^'^(ai,ai)PAiA2A;A^(ai,a2,ai,a2) (50) 



and satisfy 



and 



5p\ 4, («!, a\ ) , „, 
' \ = jfj<5^=^^<5fM(/3i,ai)<5(/32,/3^).5(/3;,a'i) (51) 

OPBiB-iB'^B'^iPl, P2, Pi, P2) ^ 



5p^I (q;2, a'n) , „, 

— — =5|M^^^i4=^<5(/32,«2)<5(/3i,/30<5(/3^,ai). (52) 



5pBt B2B[ B'^ (/3l , /32 , , /32) 

The structure kernels for the composite system are 

^abc — ^a-ia^b-ib^c-ic^ 

= (<5aiS; ^Ci a; ^BiC[ ^A2B'^ ^C^A'^ '>B2C: 



d{ai,(3[)S{^i,a[)6{pi,j[)d{a2,P'2)S{j2,a'2)S{p2,l'2) 

- ^ AxC\^ BxA\^CxB\^ A^Cl^^B^A'^^C^B'^ X 

5(ai, 70^(/3i, a'i)J(7i, /?i)^(«2, 72)'5(/32, 4)5(72, /32)) ■ 



(53) 



The following two results solve generally the question of faster-than-light telegraphs in both Hamiltonian and triple 
bracket frameworks. 
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Lemma 1 Reduced density matrices of the subsystems satisfy 

nabc^^-^^O. (54) 

OPa Opb 

Proof: It is sufficient to contract (|5^) with (|5T| ) and ([5^).D 

Theorem 2 Let F = F[p^] and G = G[p^^], that is depend on p via ( ^9[ ) and (|50|), then for any S 

{F, G}s = 0. (55) 

Proof: By virtue of the lemma one has 

dpa dpb 6p'^^ Spf^l Spc 

□ 

Notice that we have not assumed anything but differentiability not only about S but also about F and G. So, in 
particular, for arbitrary (nonlinear) observables and 5 = 52 we obtain the Polchinski- Jordan result for Weinberg's 
nonlinear QM. 



C. Density Matrix Interpretation of Solutions of the Generalized Evolution Equation 

One of the essential questions we have to clarify concerns the density matrix interpretation of the solutions of the 
generalized Liouville-von Neumann equation 

■^Pa=[Pa,H,S]. (57) 

dr 

There is no general a priori guarantee that the generalized dynamics will conserve positivity of p. The next theorems 
will give a partial answer to this problem. 

In order to attack the question we have to make the language of the 5-brackets more readable. Consider the triple 
bracket [F, G, H] of arbitrary functionals F, G and H. We find that 

.[F,G,H]= 



6p%{f3',a') 5p%,{^',P') 5p^^,(a',7') 

SF SG SH 



(58) 



Sp^'^,{y,a') Sp\,{a',P') 5p^'c,{P',i'y 
Applying the notation of (p^) (where now the "operator" kernels are in general p-dependent) we transform (pq) into 



F^s.{a\[3')G'''c.{P\iW'Ai.^') 

- F^c' 7')G^A' c^')H^B' (y, /3') = Tr ([F, G]H) . (59) 

In the last line we have introduced an abbreviated convention based on the assignment to any functional F of an 
operator 

SF 

F^— (60) 
which is defined by the kernel form used in (|59|). For example 

P-^, (61) 

and 
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the latter being the shortened form of 

STt (p") 



SpAA'ia,a') 



^=V-', (62) 
dp 



X S{p'„, a)S{a', P2)S{p'2,P3) ■ • ■ <5(/3;_i, /3n) 

X PB,B' iP2,P'2) ■ ■ ■ PB„B' (/3n, P'n)- (63) 



The first of these implies the known result 

[F,G,S2] = -tTr{p[F,G]) (64) 
leading to the von Neumann/Heisenberg equations for states/observables in linear QM 

^TT{pF) = -iTr{p[F,H]). (65) 

The same equation is valid in the Polchinski-Jordan density matrix formulation of Weinberg's NLQM [ p^p^ , but 
then F — F[p], etc. Consider now a functional S (differentiable in fk) 

S[p]^S{Mp],...Jn[p],...) (66) 

where fk[p] = Tr(p'=). 

Theorem 3 For any to G N, and any G, if S satisfies ( |66|) then 

[f^,G,S]=0. (67) 



Proof : 



[Tr (p"), G, S] = 5][Tr (p™), G, /„]— = -zto ^ nTr {G[p^-\ p-^]) — = 0. (68) 



□ This interesting result covers many nontrivial generalizations of 52- As a by-product it shows also that the 
same property holds for the Weinberg-Polchinski- Jordan NLQM because we have not assumed that G is linear in p 
(moreover, it includes other theories where observables do not satisfy any homogeneity condition). The particular 
case TO = 1 implies that Trp is conserved by all evolutions, a fact important for a definition of averages. For pure 
states Tr(p™) = (Trp)™ so that the integrals fm are not necessarily independent, but for all m,n and /„ are in 
involution with respect to {•, ■}s- Jordan proved in by an explicit calculation that in his formulation of Weinberg's 
nonlinear QM Trp and Trp^ are conserved — our theorem considerably generalizes this result. 



Theorem 4 Let S satisfy (66) and pt he a self-adjoint solution of (|57|). // po is positive and has a finite number of 
nonvanishing eigenvalues pk{0), < Pfc(O) < 1, then the eigenvalues of pt are integrals of motion, and the evolution 
conserves positivity of pt ■ 

Proof: Since the nonvanishing eigenvalues of po satisfy < Pfc(O) < 1 < 2, it follows that for any a Pk{0)°' can be 
written in a form of a convergent Taylor series. By virtue of the spectral theorem the same holds for Pq and Tr (pg ). 
Each clement of the Taylor expansion of Tr (pq ) is proportional to fn[po\i for some n. But /„[po] — fn[pt] hence 

Tr (p^) = Tr {pf) = ^p,.(0)" = ^Pfc(t)" (69) 

fe k 

for all real a. Since all pk{0) are assumed to be known (the initial condition), we know also J^kPki^)" — J2kPk{t)" 
for any a. We can now apply the result used in the information theory Q stating that the knowledge of X]fc-Pfc(0" 
for all a uniquely determines Pk{i)- The continuity in t implies that pk{t) — PkiO). □ 
The spectral decomposition of the density matrix 
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Pt = ^Pk\k,t){k,t\, 

k 



(70) 



where t i— > |fc,t) defines a one-parameter continuous family of orthonormal vectors, leads to the unitary (although 
/9-dependent) transformation \k,t) = U{pt, pa)\k,0) . The density matrix evolves then as follows 

Pt = U{pt,po)poU{pt,po)~'^. (71) 

The question whether the same holds good for pQ having an infinite number of nonvanishing eigenvalues will be left 
open here. In any case, it seems that the above theorem is sufficient at least "for all practical purposes" . 

To make our proposal more concrete, we have to choose some explicit "physical" class of S* — and here the 
information theoretic introduction may be helpful. 

The suggestion of Wigner that a natural arena for nonlinear generalizations of the linear formalism of QM is the 
domain of observations leads to investigation of systems that can gain information hence are described by a 7^ 2 
entropies. A homogeneity preserving generalization of S2 for other a-entropies can be, for instance. 

The choice of the denominator is important only from the point of view of the homogeneity of the evolution equation. 
The multiplier 1 — 1/a guarantees that the evolution of pure states is the same, hence linear, for all a ( this is 
reasonable as pure states have the same, vanishing a-entropies). The generalized Liouville-von Neumann equation 
following from (72) is 

V- (Trp)i/(-i)-i [^'^ (^^^ 

For pure states and Trp = 1, p" = p and the equation reduces to the ordinary, linear one; for mixed states the 
evolution is nonlinear unless the states are "so mixed" that p is proportional to the unit operator (which makes sense 
in finite dimensional cases, of course) and all a-entropies reduce to the Hartley formula. 
The evolution of (now linear) observables is governed by 

which shows that for the generalized S the time derivative of an observable is not linear in the density matrix. For 
a — 2 the equations reduce again to the ordinary linear equations. 
It seems that the following choice of Sa is also interesting: 

1(^(P^))^ 

^"[^] = 2 (Trp)V(^-iKi - (^^^ 

For pure states the expression reduces to the linear form ^(t/'It/')^ = ^Tr (p^). The density matrix would satisfy then 
the equation 



1 a (Tr(p")) 



l/(a-l)-l 



-d7^=2^ (T:p)V(^-i)-i [^'^ ] ^''^ 



which for pure states and normalized p would become 

a — 1 d - , , , 

2 ^ p^[H,p] (77) 

a dT 

and the "Boltzmann-Shannon classical limit" a — + 1 of the Renyi entropy is indistinguishable from the ?i — > classical 
limit of QM. 
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D. Composition Problem for Subsystems with Different Entropies 



Assuming that the formaUsm is apphcable to a description of the composite "object+observer" system, where the 
nonlinearity is a feature of the observer, we have to know how to combine systems that are described by different 
entropies. 

I think it is best to approach the question again in an information theoretic way. To begin with, let us consider a 
system whose entropy is la, and whose subsystems have entropies of the same kind: for all k a fc-th system's entropy 
satisfies /q,^ = la- Let the fc-th subsystem be described by a reduced density matrix pk- The entropy of the "large" 
system should be defined, as usual in information theory, as the average entropy of the subsystems. The overall 
entropy of the large systems should not depend on the way we decompose it into subsystems. Therefore the average 
cannot have the apparently natural form 

n 

Ia,...aAp\ ^^PklaiPk], (78) 
fc 

where pk are some weights, because the LHS is sensitive to correlations between the subsystems whereas the RHS is 
not, so that the entropy would be sensitive to the decompositions which are arbitrary. It seems we have to assume 
that in such a case the composition takes the trivial form 

Ia,...aAp] = ^PklaAp] ^Pklc,[p] = Ia[p]- (79) 
k k 

Consider now a situation where the different subsystems have different entropies, say, la,. ■ The average entropy of 
the composite system is now defined in analogy to (^9|) as 

/oi...a„[p]=^Pfc/aJp] (80) 

where the probabilities pk are weights describing the "percentage" of each of the entropies in the overall entropy of 
the system. We do not know how to determine the weights — they can play a role of parameters characterizing the 
system. 

The above definitions imply that the Q;*-entropy of the large system is 

c..o„[/'] = n^ajpr, (81) 

Oik 

so it is natural to define 

^o....a„[p]=n^"'=['^r- (82) 

Oik 

Denoting the latter expression by S, we obtain 

d S 

-j^Pb^^Pk[Pb,H,Sak] — - (83) 

Consider again a system which consists of subsystems equipped with the entropy of the same kind. Then Sak — Sai — 
S for all k and I and the system evolves according to 

^p,^[p,,H,S] (84) 

as expected. 

The next possibility is that the entropies that sum to the overall entropy are again sums of some other entropies. 
The description of the whole system should not depend on the order in which the partial entropies are summed up. 
So consider two entropies and S^^ , with appropriate weights A''^ and A''^^, and let the entropies and S^^ consist 
of some other entropies and Sf^ appearing with weights {pi}k=i ^'^'^ {pf^lfin respectively. Then 

^pt = J2 ^ VJP., H}ss §T + Y1 ^"p'l'iP'^^ H}su A (85) 

k ^ I ' 
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which shows that the evolution can be indeed consistently composed of "sub-entropies" . 

If all Sa^ip] depend on p only via fm[p], like in our definitions ( fz^ ) and (fz^), we know that on general grounds 
they are integrals of motion. Consider a subsystem described by pk and which is noninteracting with the subsystem 
"where the nonlinearity resides". In such a case the overall Hamiltonian function is H[p] = J^k Hk[pk] and 

d S 

-i-Pkb^^Pi[pkb,Hk[pk],Sai\-^. (86) 

A system described by pk and Sa^ can be totally isolated from the "rest of the Universe", if for k I, 
[pkb, Hk[pk], Soil] — ^^'^ Pk = Sa,^/S. However, even in such a case the global properties of the large system 
leave their mark on the local properties of all the subsystems as pk S/ Sq^. is at most an integral of motion hence 
depends on initial conditions. Consider a general H (including the interaction) and let 

S[p]^Soi,...o,Jp]^l[SM'"'- (87) 

where the different subsystems have different entropies. Each of the sub-entropies satisfies 

^SM=T.P'^^M,H,SM]^^0 (88) 

in virtue of the theorem || (we have used here the fact that {F,H}s — —{F,S}h), and the antisymmetry of the 
triple bracket. Therefore not only the overall entropy, but also the sub-entropies are integrals of motion even if the 
Hamiltonian function H contains interaction terms. 

Consider now the reduced density matrix pk of one of the subsystems. Using the same theorem we find that 

d S 

-j-Pkb = Pk[pkb,H,Sak]-^- (89) 

where S/ So^. is an integral of motion but its value depends on initial conditions. If the change of the initial conditions 
does not affect the reduced density matrices in (|8^), the integral of motion is also unchanged. Therefore in order to 
change this quantity we have to change correlations between the subsystems. In particular, a time dependence of a 
linear system which is noninteracting with the nonlinear one is insensitive to changes of initial conditions within the 
linear system if the particular form ( |87| ) holds. For global entropies different from ( p7| ) some kind of sensitivity appears 
but the influences between the subsystems cannot propagate faster than light unless we introduce the projection 
postulate. 

Such a trace of nonlinearity observed in some linear system might be used to detect the nonlinearity. Following 
Santilli |l9[| we can expect that an evolution of an internal part of a hadron may be nonliner (like in hadronic 
mechanics). In such a case correlations between a hadron (say, a proton) and some linear system (say, an electron) 
could be observed in a form of a p-dependent rescaling of time in the electron's evolution. 



V. COMMENTS 

S. Weinberg wrote in ||2^ that the "theoretical failure to find a plausible alternative to quantum mechanics, even 
more than the precise experimental verification of linearity, suggests (...) that quantum mechanics is the way it is 
because any small change in quantum mechanics would lead to logical absurdities. If this is true, quantum mechanics 
may be a permanent part of physics. Indeed, quantum mechanics may survive not merely as an approximation to 
a deeper truth, (...) but as a precisely valid feature of the final theory." This kind of conviction followed from the 
internal theoretical difficulties of the generalizations based on nonlinear Schrodinger equations and general Hamiltonian 
framework. These difficulties have been discussed in detail in 

The proposal based on the generalized Nambu dynamics is free from those difficulties. However, it has new features 
with respect to ordinary QM. One, the stronger complementarity principle, has already been announced. In the 
generalized framework a time derivative of an observable will not, in general, be linear in the density matrix. Since 
we have defined observables as functions necessarily linear in p, the time derivatives of observables are not themselves 
observables. 

I propose the following interpretation of this fact. To focus our attention let us consider linear QM and the 
nonrelativistic position operator. An average velocity of an ensemble of particles can be calculated either by first 
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calculating an average position and then taking its time derivative, or by first measuring the velocity of each single 
particle and then taking the average. We can say that the first procedure is a calculation of the time derivative of an 
average, whereas the latter is taking the average of the time derivative. The situation can be described symbolically 
by the equation 

An important property of QM is the impossibility of realizing the two procedures simultaneously, as v = p/m and q 
are complementary. It follows that in a concrete experiment we have to decide which way of measuring to choose. In 
this meaning if we can measure q, we cannot measure -^q, and vice versa. To express it differently, if (f is observable 
(not an observable!) then ^9 is not. 

In triple bracket NLQM the observables will be defined as quantities that are in one-to-one relationship to some ex- 
perimentally measured random variables (hence the linearity in p) . Two observables will be said to be complementary 
if there does not exist a physical situation where the two respective random variables can be measured simultaneously, 
that is, in a single run of an experiment. There can exist linear operators representing the position and the velocity of 
single members of an ensemble, but if the ensemble evolves in a nonlinear way, the averages of those observables do not 
have to satisfy the inherently linear condition ( pO[ ) , if the experimental procedures necessary for their measurements 
cannot be simultaneously realized. 

Another fundamental problem, arising in the Nambu-like description, is the action principle leading to the triple 
bracket equation. The Hamiltonian NLQM proposed by Kibble or Weinberg can be derived from the ordinary 
Lagrangian formalism. The triple bracket form of dynamics must follow from a new kind of variational principle. The 



variational principle proposed recently by Takhtajan 21 1 suggests an interesting direction for further investigations. 



VI. APPENDIX: HAMILTONIAN FORMULATION OF THE DIRAC EQUATION 

Consider the Dirac equation 

(i7°Va - = (91) 

where Va = da + ie^a and is an electromagnetic potential world-vector. We are going to rewrite the equation 
in a form of the "proper-time" covariant Hamilton equations of motion. The "proper time" will be defined in terms 
of spacelike hyperplanes constructed as follows. Let CTT-(a;(r)) = be an equation defining a family of spacelike 
hyperplanes. The field of timelike, future-pointing, normalized vectors n'^{x) oc d°'a-r{x), satisfying the continuity 
equation dan'^{x) — 0, defines the field of "proper time" directions. Integral curves r x°'{t) of n^{x), where r is the 
parameter of the family {ctt}, play the role of the world-lines. We shall need the continuity equation to guarantee the 
reality of the Hamiltonian function. Notice that this condition eliminates some physically meaningful hyperplanes, 
like the proper-time hyperboloid ar^x) = x°'Xa — = 0, but admits simultaneity hyperplanes Orix) = n°-Xa — t = 0. 
The "proper time" following from the construction should not, for this reason, be identified with the ordinary proper 
time of the electron. The "proper time" derivative at x is defined as 

4- = K{^)da. (92) 
CLT 

Multiplying (|9l| ) from left by the Dirac matrices we obtain 

(iV + a^^Vb ~ m-i'')i; = 0. (93) 

Writing the four-potential explicitly in 

id'^ii; = {-a'^^db + e$" + iea^^^b + ^7^)^' (94) 

and contracting with n'^{x) we get 

i^V(a;) = (-cTabK{x)d'' + e<(a;)$„(a;) -t- ieaahK{x)^^ {x) 



+ m<(x)7ajV'(a;) (95) 
Hi}{x). (96) 
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In the spinor language 

zV^^>^=MX-^' =zg,^^'V'^0A (97) 
«Vaa'X^' = = «5aAA' V^x^' (98) 

where /x = vi/\/2 and f?^^ '^"^ are the Infeld-van der Waerden symbols |12|. Using the identities 



a bYA' 
XA'9 


+ 9' 


aYA' 

XA'9 


ab^ Y 

= 9 £x 


a bYA' 
XA'9 


-9' 


aYA' 

XA'9 


A^ab Y 


a bAY' 
AX'9 


-9' 


aAY' 

AX'9 


A}^ab Y' 
= 4ct X' 



(99) 
(100) 
(101) 



where fj"'^ x^ and fj"'^ x^ generators of (^, 0) and (0, i) representations of SL{2, C) we obtain 



iVaCj^X = -^l(7^kx^V''(j)Y + 2ngaXX'X^' (102) 
i^aX""' = m,,Y^'w'>x'^' -^f^9a''''''f>X (103) 

and the equations obtained by their complex conjugation. These equations are especially simple if we express 
generators and Infeld-van der Waerden symbols in purely spinorial terms. Remembering that n° Uar = 1 implies 
nAA'T n^"^ = \ej^^ we get after some calculations 

i-^4>x{x) =in^^ {x)VxY'4'Yix) + nnrxx'{x)x'^ (x) 
dr 

+ en'!,{x)<^a{x)(px{x) (104) 

i^X^'ix) ^inrYY'{x)V^^'x^\x) ~ fin^^'{x)q^xix) 
dr 

+ e<(a:)$,(a:)x^'(x) (105) 

Let dar{x) be some invariant measure on the hyperplane ar- The equations can be derived from the Hamiltonian 
function 

{i <f>*x, (a;)nf ^' {x)n^^' (x)Vxy' (t>Y (x) 
- iX*"" ix)nrXX'ix)nrYY'ix)V''''' x""' (x) (106) 



+ {rx'i^)x'''ix)+x*''ix)q^xix)) 

+ e < {x)^a (x)n^^' {x) ((j)x {xWx' {x) + Xx {x)xx' (a;)) }dcr^ (x) 



provided 



d'''''n^xx'{x) - d'''''nrXY'{x) - (107) 



and the wave functions vanish at boundaries of the hyperplane ar- Reality of H is guaranteed by the same conditions. 
The Hamiltonian function is n ot positive definite, which is correct since we are working here in first quantized 



formalism. Contraction of (107) over the remaining indices implies the continuity equation discussed above. 



The explicit form of the Hamilton equations is 



.nf-'(x)A^,(,)._^, (108) 
.n.x.'(x)|:x"'(.x) = ^^, (109) 
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and c.c, or, in the Poissonian way, 



^|:0x(.) = 2n...,(.)^, (110) 
4x-'(.)-2nf-'(x)^^, (111) 

We can see that i n-^xx' {x) — uJtXX' {x) are the components of the symplectic (since derivable from a Kahler potential 
II tp IP) form on CTt- at point x G Ut, and the Poissonian form ItXX'{x) = —2inTXX'{x). 

Let "ia^, a = 0, 1, 2, 3, be the Dirac matrices. The Hamilton equations equivalent to the Dirac equation written in 
the bispinor form are 

-?(^)7r^^.(^) = ^- (112) 

and c.c, where * denotes the complex conjugation. The formulas are simplest if we take simultaneity hyperplanes 
foliation of the Minkowski space. Then n'^{x)j2^ = lo^ ■ Denoting its inverse by 7oq/3 we find that 

d ^ , , 6H , , 

^^.(x)=-z7o./,^ (113) 

^C(-)=^7o.^^ (114) 



So here the Poissonian form is lap = —i^oap and the Dirac matrix 70 corresponds to 5ab' discussed in III, The 
transition to the triple bracket formalism is now straightforward. 
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